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Abstract 

In these introductory notes for 'pedestrians' we describe the current 
state of the art in the science of complex networks. 
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1 The birth of network science 

In 1735, in St Petersburg, Leonhard Euler solved the so-called Konigsbcrg 
bridge problem — walks on a simple small graph. This solution (actually, a 
proof) is usually considered as a starting point of the science of networks.-*^ 

2 What eire random networks? 

Random networks are statistical ensembles of graphs. A statistical ensemble 
is a set of its members — particular graphs, each of which has its specific 
probability of realization — a statistical weight. 

In empirical studies, as a rule, a single member (a particular realization) 
of this ensemble is observed. In simulations, a finite number of realizations 
of the ensemble may be obtained. 

As is standard in statistical mechanics, statistical ensembles are classified 
as equilibrium or non-equilibrium. In our case, these are equilibrium and 
non-equilibrium (e.g., growing) random networks. 

3 Adjacency matrix 

The complete description of a particular graph is provided by its adjacency 
matrix. A graph of N vertices has an NxN adjacency matrix. Each element 
aij of the adjacency matrix is equal to the number of edges connecting the 
vertices i and j. 

4 Degree distribution 

The simplest local characteristic of a vertex is its degree: the total number 
of the edges attached to a vertex, that is the total number of the nearest 
neighbours of a vertex. 

In directed networks, the number of incoming/outgoing edges of a vertex 
is called its in-/out-degree. 

In a random network, a degree distribution is the average fraction of 
vertices of degree k: P{k) = {N{k))/N. Here N{k) is the number of vertices 
of degree A; in a particular graph of the statistical ensemble. The averaging 
is over the entire statistical ensemble. 

An empirical researcher measures N{k)/N for a single realization of the 
statistical ensemble. Simulations usually allow to average N{k)/N over a 
finite set of realizations of the statistical ensemble. 

^For recent reviews and reference books, see Refs. [1-9]. 
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5 What are simple networks? Classical random graphs 

The simplest random networks are so-called classical random graphs (Solo- 
monoff and Rapoport, 1950-1952, Erdos and Renyi, 1959,1960, Gilbert 
1959). In simple terms, these are maximally random networks under the 
constraint that the mean degree of their vertices, (k), is fixed. (We assume 
that the number of vertices in these graphs is also fixed.) The maximum 
randomness means the maximum entropy of a random net. 

There are two main versions of classical random graphs: 

The Erdos and Renyi model (a widespread term) is a statistical ensemble 
of all possible graphs of precisely N vertices and precisely L edges, where 
each member of the ensemble has equal probability of realization. 

In the Gilbert model, each pair of N vertices is connected with some 
probability p. This produces a statistical ensemble of all possible graphs 
of N vertices. The members of these ensemble are weighted with some 
statistical weights. In the thermodynamic limit (infinitely large networks), 
these two versions are equivalent {{k) = p{N — 1)). 

The degree distribution of classical random graphs has a Poisson form: 
P{k) ~ {k)^ /kl. Here (A;) is fixed as ^ cx). This is an extremely rapidly 
decreasing distribution with a natural scale A; ~ {k). All its moments con- 
verge. 

6 The birth of the giant component 

The limit with a fixed (k) as iV ^ oo corresponds to a sparse network. In 
a sparse net, the mean number of connections of a vertex is much less than 
the number of connections of a vertex in a fully connected graph. 

Why is the case of a sparse network most interesting? The important 
feature of a network is its giant connected component. This is a set of 
mutually reachable vertices containing a finite fraction of vertices of a large 
network. Without the giant connected component, a net is only a set of 
small separated clusters. It turns out that in the classical random graphs, 
the giant connected component exists if the mean number of connections 
of a vertex exceeds one. So, this characteristic point of a networks — the 
point of 'the birth of the giant connected component' — is just in the range 
of extremely low degrees, (fc) ~ 1 ^ N. 

7 Topology of the Web 

Directed networks are networks with directed edges. As Fig. 1 shows, these 
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Figure 1: The global structure of a directed network. The giant weakly connected 

component (GWCC) , which is the giant connected component of the undirected pro- 
jection of this network, contains: (1) giant strongly connected component (GSCC), 
(2) giant in-component, (3) giant out-component, and (4) 'tendrils'. 

networks, generally, have a far more rich global structure than undirected 
ones. There are several types of giant connected components in directed nets. 
The core of a directed network is its giant strongly connected component, 
which consists of vertices mutually reachable by directed paths. 

Note that the scheme in Fig. 1 is applicable, in general, to any directed 
network, including lattices. In particular, the WWW has this global struc- 
ture [10]. 

8 Uncorrelated networks 

In principle, connected vertices (i.e., vertices, for which a connecting path 
exists) may be correlated. The examples of these correlations are (1) loops 
and (2) correlations between degrees of connected vertices (e.g., the nearest 
neighbours). 

Evidently, in large classical random graphs, correlations are absent: de- 
grees of connected vertices are uncorrelated, and loops are not essential in 
the large network limit. Such (equilibrium) random networks are called 
uncorrelated. 

The fact that loops are not essential in the thermodynamic limit implies 
that any finite neighbourhood of a vertex has a tree-like structure. 
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9 What are small worlds? 



Accounting for the last circumstance allows us easily estimate the 'linear 
size' of a classical random graph, that is the mean length of the shortest 
path between two vertices. Evidently, the mean number of the 77,-th nearest 
neighbours of a vertex rapidly grows as (fc)"". So, the average shortest-path 
length £ is roughly estimated by the condition {kY ^ N. Then, 

- InA^ 

This formula is asymptotically exact in classical random graphs. It also 
works well in random networks, where degree distributions rapidly decrease 
with k. 

Compare formula (1) with the expression i ~ N^/'^ for a linear size of a 
d-dimensional lattice. In networks, the size dependence i{N) is slower than 
in any finite-dimensional lattice or fractal. 

The growth of ^(A^) slower than any positive power of N is called a 
small-world effect. By definition, a network is a small world if it shows 
the small-world effect. One can see that small-worlds are infinite- dimension 
objects. This feature is a basic property of networks. 



10 Real networks Eire mesoscopic objects 

Formally speaking, it is hard to make a solid conclusion that a given real- 
world network displays a small- world effect. The reason is that, as a rule, real 
networks arc small (small numbers of vertices): there are 10^-10^ vertices 
in most of empirically studied biological networks, about 2 x 10^ vertices in 
the Internet at the Autonomous Systems level, several hundreds thousands 
routers in the Internet, and 'only' about 10^° pages in the WWW. 

These numbers are not large enough to check the small-world effect. 
Moreover, these numbers are not large enough to treat networks as macro- 
scopic systems, where the measurement of a small fraction of a system allows 
to arrive at complete conclusions about the entire system. 

Real networks are mesoscopic objects.^ That is, a whole system (or, 
at least, its essential part) must be explored to arrive at the 'complete' 
knowledge of the system. Note that, as a rule, the information about a whole 
real-world network (a full map, a complete adjacency matrix) is available. 

'microscopic' network is an edge connecting two vertices. 
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Usually, the complete map of a network is a starting point of the analysis of 
empirical data. 



11 What cire complex networks? 

Complex random networks are networks which are more complex than clas- 
sical random graphs. Here the term 'complex' means more complex orga- 
nization (more complex distribution of connection). In particular, a degree 
distribution may be more complex than Poisson, and/or various correlations 
may be essential. 

Real-world nets arc complex networks, usually with fat-tailcd degree dis- 
tributions, usually with strong correlations of degrees of connected vertices, 
usually with an essential role of loops. 



12 The configuration model 

The configuration model (the term is introduced by B. Bollobas) is the first 

natural generalization of classical random graphs. In very simple terms, 
the configuration model is a maximally random graph with a given degree 
distribution P{k). 

This complex random equilibrium network (recall — an ensemble!) is un- 
correlated. Most of results for complex networks are obtained by using the 
configuration model. ^ 



13 The absence of degree— degree correlations 

What does this mean? In particular, this means that the degrees of the 
nearest-neighbour vertices uncorrected. That is, the joint probability P{k, k') 
that an edge connects vertices of degrees k and k' is 



''There is another, more traditional for statistical mechanics, way to build ensembles 
of networks. Sometimes, it is referred to as the exponential model. The members of 
the statistical ensemble in this construction are systems (sets) of local configurations of 
vertices and edges. Each kind of these clusters ('bricks') has its 'excitation energy'. By 
thermal excitation one can obtain a full set of realizations (networks) of the ensemble. The 
specific excitation energies determine the statistical weights of these realizations, that is 
the structure of the resulting random network. 
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Indeed, each end of an edge with equal probabihty may occur to be in 
any of 2L = (k) J2k kP{k) possible positions. So, an end of an edge in an 
uncorrelated network is attached to a vertex of degree k with probability 



14 Networks with correlated degrees 

The simplest case of degree degree correlations are correlations between the 
degrees of the nearest-neighbour vertices. If these correlations are present 
in a network, the joint distribution P{k, k') differs from the right-hand side 
of relation (2). 

The configuration model may be generalized to include these correla- 
tions. The resulting correlated network is a maximally random graph under 
the restriction that the joint degree-degree distribution P{k, k') is equal to 
a given function. (The degree distribution follows from P{k,k') and so is 
also fixed.) 

The only type of correlations in the resulting network are correlations 
between the degrees of the nearest-neighbour vertices. So, the network also 
has a tree-like local structure. 

Proceeding in this way one can construct networks with more and more 
complex correlations between degrees of connected vertices. 

15 Clustering 

Loops are specific correlations in networks. The notion of clustering is re- 
lated to loops of length three (triangles of edges) . The local clustering is the 
relative number of connections between the nearest neighbours of a vertex i 



Here ki is the degree of the vertex, rii is the total number of connections 
between its nearest neighbours. Averaging Cj over vertices of degree k pro- 
vides the degree- dependent local clustering C (k) , which shows the probability 
that two nearest neighbours of a vertex of degree k are connected to each 
other. 

The mean clustering is defined as 



kP{k)/{k). 



Ci = 



h{h-i)/2 ■ 



(3) 




(4) 



k 
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Another clustering characteristic, the clustering coefficient, is defined as 



- {h{h - l)/2) ((fc2) - {k))/2 • ^ ' 

The clustering coefficient is three times the ratio of the total number of edge 
triangles and the total number of the connected triples of vertices. Note that 
if the local clustering is degree-dependent, (C) ^ C. The relative difference 
may be great in real- world networks. 

As is natural, in infinitely large uncorrelated networks, clustering is ab- 
sent. So, in uncorrelated networks, the clustering is only a finite size effect. 
For example, in the classical random graphs, 

C{k)=C = {C)^^-^. (6) 
In the configuration model. 



= C = (C) ^ . (7) 



m - {k)? 

N{kY 

In networks without degree-degree correlations, the local clustering is 
degree independent, and C = (C), but this is a rare exception. Formulae 
for C{k), C, and (C) in networks with degree-degree correlations are given 
in Ref. [11]. 

We believe that empirical data on clustering is usually determined by 
the form of P{k) and P{k,k'). So, the strong enough clustering may be 
explained by using formula (7) and corresponding expressions for networks 
with degree-degree correlations, without implementing some specific mech- 
anism of strong clustering. 



16 What eire small- world networks? 

Nevertheless, one has to admit that as a rule, real-world networks have re- 
ally strong clustering. Moreover, the values of the clustering coefficient are 
so high in some networks, that it is hard to believe that it is a finite-size 
effect. Watts and Strogatz proposed a specific class of complex networks, 
which have a small- world effect. They have named them small- world net- 
works. These are lattices with high clustering (e.g., a trigonal lattice), where 
randomly chosen vertices are connected by long-range shortcuts. 

Actually, a small-world network is a superposition of a lattice and a 
classical random graph. Due to the strong clustering of a mother lattice. 
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a small-world network has high clustering. Due to the compactness of the 
classical random graph, a small-world network is compact. 

17 'Small worlds' is not the same as 'small-world networks' 

The small-world networks are very specific graphs. In contrast, the small 
worlds, that is networks with a small- world eff^ect, is an incredibly wide class 
of networks — practically all networks which we discuss. These networks are 
more compact than any finite-dimension lattice. 

18 Fat-tailed degree distributions 

Actually, properties of classical random graphs do not differ tremendously 
from those of infinite-dimension lattices. However, if a network has a de- 
gree distribution with sufficiently slowly decreasing degree distribution, as 
in most important real-world networks, the difference is striking. 

Usually researchers try to fit empirical degree distributions by specific 
power-law dependences P{k) oc k~'^ (scale- free degree distributions). How- 
ever, a far more important (and reliable) observed fact is that the higher 
moments of the empirical degree distributions diverge in large networks. 
This observation shows that, with noticeable probability, vertices of high 
degree are present in real networks, unlike classical random graphs. It is 
this presence that produces strong effects. 

19 Reasons for the fat-tailed degree distributions 

The main explanations of the fat-tailed form of empirical degree distribution 
are as follows: 

(1) Self- organization (or, rather, self-organized criticality): while evolving, 
a network self-organizes in a structure with an essential role of hubs. 

(2) Optimization processes involving m,any agents: vertices arrange their 
connections in the optimal way. Actually this means an extensive com- 
petition of trade-offs, where each vertex 'tries to find' the optimal com- 
binations of numerous (often mutually contradictory) factors. In other 
words, everybody tries to arrange his or her connections in the best way 
taking into account different factors — everybody tries to make the best 
choice. 
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(3) Multiplicative stochastic processes may produce slowly decreasing dis- 
tributions. (In multiplicative stochastic processes, variables change by 
random factors with time, that is the changes are relative.) 

(4) The interconnection of geographically close vertices may result in net- 
work architectures with fat-tailed degree distributions, Refs. [12, 13]. 

(5) Fat tails of degree distributions may emerge as a secondary effect. Sup- 
pose that connections in a net are determined by a set of some intrinsic 
properties of vertices ('hidden variables'). The statistics of these hidden 
variables is explained by 'external' reasons. Specific forms of the distri- 
butions of hidden variables result in slowly decreasing distributions of a 
vertex degree, see Refs. [14-18]. 

Note that this classification is rather conventional. Rigid boundaries 
between explanations (1), (2), and (3) are absent. 

20 Preferential linking 

Maybe, the most popular self-organization mechanism is preferential attach- 
ment (preferential linking) : vertices of high degree attract new connections 
with higher probability. In more precise terms, he probability that a new 
edge become attached to a vertex with k connections is proportional to some 
function of fc, a preference function, f{k).'^. The resulting structure of an 
evolving net is determined by the form of this function. 

Scale-free degree distributions may emerge only if the function f{k) is 
linear, that is the probability of attachment is {k + A)/{{k) + A), where A 
is some constant. It seems, this is a widespread situation in real networks. 
This form of preference usually produces the 7 exponents between 2 and 
infinity. 

Models of evolving systems based on this concept was proposed by 
G.U. Yule (1925) and H.E. Simon (1955). To growing networks, this idea 
was applied by D.J. de S. Price (1976) — a linear preference function, and by 
A.-L. Barabasi and R. Albert (1999) — a proportional preference function. 

In particular, in the Barabasi- Albert model, a growing network is a so 
called 'citation graph'. This means that new connections emerge only be- 
tween a new vertex and existing ones. At each time step, a new vertex is 

"^In 'inhomogeneous networks', the form of the preference function varies from vertex 
to vertex. 
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added to the net and become attached to one or several vertices, which are 
chosen with proportional preference. 

21 Condensation of edges 

The preferential linking mechanism effectively works only in non-equilibrium 
networks. In more strict terms, only in non-equilibrium (e.g., growing) net- 
works, linear preferential linking necessarily leads to scale-free architectures 
at any mean degree values. In equilibrium networks, even linear preference 
produces fat-tailed distributions only above some critical value of the mean 
degree. 

Below this point, that is in a more sparse network, the degree distribution 
is a rapidly decreasing function. Above this point, the condensation of edges 
takes place. In other words, a finite fraction of edges turns out to be attached 
to a vanishingly small fraction of vertices, or even to a single vertex. The 
degree distribution for the rest vertices is fat-tailed. 

Strong inhomogeneity of a network also can lead to the condensation of 
edges. 

22 Cut-offs of degree distributions 

Clearly, in a finite size network, vertices of an infinitely large degree are 
absent. This means the presence of a size-dependent cut-off in the degree 
distribution, so that 'perfect' scale-free degree distributions are impossible. 
In small networks, a cut-off obstructs the observation of fat-tailed distribu- 
tions. 

The position of the cut-off depends on details of a network. E.g., in 
scale-free citation graphs of size A^, the cut-off is kcut ~ N^/^'^~^\ in other 
situations, fccut ^^^Y be of the order of 7V^/^, etc. 

23 Reasons for correlations in networks 

The main reasons for correlations in networks are as follows: 

(1) Degree-degree correlations are the immediate result of the evolution of 
a network. In evolving network, there are only few exceptions, where 
degrees of the nearest-neighbour vertices are weakly correlated. (One of 
this exceptions is the Barabasi-Albert model.) 
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(2) Maximally random network is inevitably uncorrelated if multiple con- 
nections and loops of length one are allowed. However, if multiple con- 
nections and one-loops are forbidden, a maximally random network with 
a fat-tailcd degree distribution may show strong correlations between the 
degrees of the nearest neighbours [19]. 

(3) Projections of uncorrelated multi-partied graphs are correlated networks. 

Let us explain the third possibility in more detail. Collaborations may 
be naturally described by so-called bipartite graphs, where the vertices of 
the first kind show collaborators, the vertices of the other type show the 
acts of collaboration, and edges connect each act of collaboration to all 
collaborators participating in this act. 

The same collaboration can be depicted by using only one type of ver- 
tices, namely collaborators. In this representation, two vertices are con- 
nected if they participate in at least one act of collaboration. The resulting 
graph is a one-mode projection of the bipartite collaboration network. 

The basic formal construction of a bipartite random network is a gener- 
alized configuration model, which is a direct generalization of the one-partite 
configuration model. In simple terms, this is a maximally random bipartite 
graph with two given degree distributions for both types of vertices. This 
is an uncorrelated bipartite network. However, its one-mode projection is a 
correlated network, which may have strong clustering, and may have degree- 
degree correlations. 

The last circumstance explains, in particular, why do real (one-mode) 
collaboration networks have strong clustering. 

24 Classical random graphs cannot be used for comparison 
with real networks 

We have again mentioned the configuration model, which is the simplest, 
basic complex random network. This model is well studied. If a degree 
distribution is fat tailed, the configuration model shows effects, crucially 
different from those of classical random graphs. Real networks are usually 
so different from classical random graphs that any comparison of real-world, 
complex networks with classical random graphs is meaningless. Instead, as a 
starting point, one must compare empirical data on a real network with the 
simplest complex uncorrelated network with the same degree distribution. 
This is the configuration model. 
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25 How to measure degree— degree correlations 

When an empirical researclier studies the statistics of vertex degree in a 
network, that is measures a degree distribution in the range < fcmax) he or 
she has data for these fcmax points, obtained from inspection of N vertices. 
When an empirical researcher studies the statistics of the correlations be- 
tween the nearest-neighbours in a sparse network — the joint degree-degree 
distribution P{k',k"), he or she has data for a much larger number fc^ax 
points, obtained from only ~ N edges. So, the statistics of the empirical 
joint degree— degree distribution arc inevitably poor, and fluctuations will be 
strong on the resulting plot. Instead, empirical researchers usually describe 
these fluctuations by using a more coarse, but less fluctuating, characteris- 
tic. This is the mean degree knn{k) of the nearest neighbours of a vertex 
of degree k. This characteristics is can be easily expressed in terms of the 
joint degree-degree distribution.^ 

26 Assortative and disassortative mixing 

Situations where vertices of high degrees mostly have the nearest neighbours 
of high degrees, i.e. where knn{k) grows with k, are called assortative mixing 
(this term is taken from sociology). 

The opposite case, where where vertices of high degrees mostly have the 
nearest neighbours of low degrees, i.e. where knn{k) decreases with k, is 
called disassortative mixing. 

For example, the Web and the Internet graph on the Autonomous Sys- 
tems level show disassortative mixing. The Internet on the router level 
has week degree-degree correlations. Collaboration networks usually show 
assortative mixing. 

27 Disassortative mixing does not mean that vertices of high 
degrees rarely connect to each other 

Rather counterintuitively, in a network where highly connected vertices 
mostly have neighbours with few connections, vertices of high degrees may 
turn out to be interconnected with high probability. 

This claim is illustrated by the following estimate. In a correlated uni- 
partite network, the average number of edges between vertices of degrees 

^We, however, recommend that hindering fluctuations be reduced by using cumulative 
distributions: Pcum{k',k") = T,g'>k' .g">k" Pil' 
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k' and k" is N(k',k") = {k)NP{k' , k"). The maximum possible number of 
connections between these vertices of degrees k' and k" is N{k')N{k") = 
NP{k')NP{k") (multiple connections are ignored). So, the fraction 
N{k',k")/[N{k')N{k")] = [{k)P{k',k")]/[NP{k')P{k")] of the possible con- 
nections is present. In simple terms, this is the probability that an edge 
between a pair of vertices of degrees k' and k" is present. For instance, in 
an uncorrelated network, the resulting number is k'k" /{N{k)) and may be 
large enough at large degrees. This is the simplest case, but evidently, simi- 
lar conclusions are valid for both the assortative and disassortative types of 
correlations. 

For example, in the Internet, the set of Autonomous Systems with high- 
est numbers of connections is practically fully interconnected. 

28 Reciprocal links in directed networks 

For simplicity, in our discussions we neglect one detail — multiple connec- 
tions. However, sometimes, the presence of multiple connections may be 
important. Often, multiple connections in networks are considered as some- 
thing exotic. The counterexample is the WWW — an directed network, 
where about 30 per cent of hyperlinks have opposite-directed ones. That 
is, if one page has a reference to another page, the latter refers to the former 
with high probability. The same occurs in directed email networks. 

29 Ultra-small-world effect 

In an uncorrelated network with an arbitrary degree distribution P{k), the 
degree distribution of the nearest neighbour of a vertex is kP{k)/ {k), which 
is quite different from P{k).^ This principal difference is the origin of many 
effects in complex networks. 

The mean degree of the nearest neighbour of a vertex is (fc^) /(/c), which 
is greater than the mean degree of the vertex in the network. This circum- 
stance changes the formula (1) for the mean shortest path length to the 
following one: 

- In TV 

^^H{{k^)/{k))-lY 

One can see, that if the second moment of the degree distribution diverges in 
the infinite network limit, the average number of the second nearest neigh- 

®A similar effect takes place in uncorrelated networks. 
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bours of a vertex approaches infinity, and the formula (8) cannot be used. 
In this situation, ^(A^) grows with N slower than In N, which may be called 
'the ultra-small- world' effect. 

30 The importance of the tree ansatz 

Formulae (1) and (8), and many other basic results were obtained by using 
the tree ansatz. This is an assumption that a sufficiently vast environment 
of each vertex of a network has a tree-like structure. In more precise terms, 
it is assumed that as the total number of vertices tends to infinity, any 
finite environment of a vertex almost surely does not contain loops. In this 
case, loops occur only if the remote environment of a vertex is added. In 
particular, the tree ansatz is valid for large uncorrelated networks. 

For many characteristics of a network, remote environments of vertices 
are not important, and the simplifying tree ansatz may be used. Moreover, 
some of results obtained in the frames of the tree ansatz are still valid for 
networks with numerous loops and strong clustering. 

31 Ultraresilience against random failures 

If the second moment of the degree distribution diverges (this, e.g., happens 
in infinite scale- free networks with 7 < 3), the average degree of the nearest 
neighbour of a vertex also diverges. This means that a vertex in the infinite 
network, in average, has an infinite number of the second nearest neighbours. 
Evidently, this indicates the presence of the giant connected component in 
the network. 

Let us remove, at random, a finite fraction of vertices or edges from the 
network (random failure). One can see that after this removal, the average 
number of the second nearest neighbours of a vertex is still infinity. In 
other words, the random removal of any finite fraction of vertices or edges 
does not eliminate the giant connected component. In this situation, even 
the random removal of 99, 99 per cent of vertices or edges does not destroy 
the 'core' of the network. That is, these networks are ultraresilient against 
random failures. 

32 When correlated nets are ultraresilient 

The above claims have been made for uncorrelated networks. Nevertheless, 
one can show that in networks with correlations between degrees of the 
nearest neighbours, the average number of the second nearest neighbours of 
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a vertex diverges in the same situation, that is when the second moment of 
the degree distribution diverges. 

So, the condition for ultraresihence against random failures is the same 
both for the correlated and uncorrclated networks: the second moment of 
the degree distribution must diverge. 

33 Vulnerability of complex networks 

On the other hand, it is highly connected vertices that enable the existence 
of the giant connected component in networks with fat-tailed degree distri- 
butions. This is only a small fraction of the total number of vertices. So 
that, an intentional damage of a network may have a strong effect, if one 
removes vertices of the highest degrees. In this case, it is sufficient to remove 
a small fraction of vertices to eliminate the giant connected component. 

34 The absence of an epidemic threshold 

Another side of the ultrarcsilicnce against random failures is the absence of 
the epidemic threshold at the same condition. That is, in simple terms, any 
finite (nonzero) rate of the infection of the nearest neighbours of a vertex in 
a network with diverging second moment of the degree distribution leads to 
a 'global epidemic'. 

Actually, the problems of the spread of diseases and random failures 
(or the percolation problem) are closely related. A low infection rate in the 
spread of diseases corresponds to a high fraction of removed vertices or edges 
in the percolation problem. The pandemic corresponds to the presence of the 
giant connected component in the randomly failed network. So, the absence 
of the epidemic threshold corresponds to the impossibility to eliminate the 
giant connected component of the network by random removal of vertices 
or edges. 

35 Search based on local information 

Another effect of the divergence of higher moments of degree distribution 
is a diminishing of the characteristic time of a so called 'local search' [20]. 
Suppose that each vertex contains a full information about its nearest neigh- 
bours. In the local search problem, one must find a vertex with some desired 
information. 

A possible search strategy looks as follows. Start from an arbitrary ver- 
tex, and move along edges randomly, from vertex to vertex. Then, recalling 
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that the mean degree of the nearest neighbour of a vertex is {k'^)/{k), the 
typical search time is estimated as N / {{k"^) / {k)) . Consequently, this search 
is quick in networks with large second moment of the degree distribution. 

36 Ultraresilience disappears in finite networks 

The ultraresilience against random failures and the absence of the epidemic 
threshold are determined by the divergence of the second moment of a degree 
distribution. This divergence is possible only in infinite networks. If a net 
is finite, the degree distribution necessarily has a size-dependent cut-off, 
and all the moments of the degree distribution are finite. In this case, 
the average number of the second-nearest neighbours of a vertex is finite, 
the giant connected component can be removed by random removal of a 
sufficiently large, but finite, fraction of vertices, and the epidemic threshold 
exists. 

Thus, for the observation of the ultraresilience and of the disappearance 
of the epidemic threshold, a network must be very large. Recall that as a 
rule, real networks are small. 

37 Critical behavior of cooperative models on networks 

We have explained that networks are infinite dimension objects. Physicists 
know that critical fluctuations in cooperative models on infinite dimension 
objects are absent, and so the critical behavior is described by mean-field 
theories. 

So, the critical phenomena in networks should be described by mean-field 
theories. Indeed, in the case of equilibrium networks with degree distribu- 
tions with a well-defined scale, the standard mean-field theory, with standard 
critical exponents, is valid. 

On the other hand, if the degree distribution of an equilibrium network 
is fat-tailed, critical behavior is non-standard. This implies unusual values 
of critical exponents. Moreover, the order of a phase transition may be high 
and even approach infinity. Nonetheless, the critical fluctuations are absent 
even in this case, and a mean field theory still works. The point is that 
the mean-field behavior is non-standard because of the presence of highly 
connected vertices in the network. 

The above general claims are valid for various cooperative phenomena 
(percolation, magnetic phase transitions, etc.) in various networks (corre- 
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lated and uncorrelated networks, small-world networks, etc.)^. 

38 Berezinskii-Kosterlitz-Thouless phase transitions in net- 
works 

There is an impressive exception from the above scenario. In some situa- 
tions, nonequilibrium networks may have a 'non-mean-field' phase transition 
with the Berezinskii-Kosterlitz- Thouless singularity. Near this infinite-order 
phase transition, the order parameter (e.g., the size of the giant component 
in percolation-like problems) changes as exp(— const/V^ — &c)- Here h is 
some control parameter, and 6c its critical value. 

39 Cascading failures 

We have explained that self-organized-criticality mechanisms can produce 
complex network architectures. On the other hand, one can study avalanches 
and other phenomena associated with self-organized criticality on networks, 
e.g., a sand-pile problem on networks, Rcf. [21]. The impressive eff'ect of 
cascading failures in power grids explains the wide interest in avalanche 
phenomena on networks. 

In the simplest model [22], cascades on networks are related not to self- 
organized criticality but rather to percolation problems and to the spread of 
infections. The model illustrates the phenomenon of global cascades induced 
by a local perturbation on networks. In this model, a vertex may be in two 
states: A, which is the initial state and B, which is the perturbed state. The 
dynamics of the model is described by the following rule: a vertex adopts 
state B if at least a fraction p of its nearest neighbours are in state B. Here 
the threshold < p < 1 is the parameter of the problem. Otherwise, vertex 
remains in state A. This process indeed resembles the spread of infections, 
and the global cascades correspond to pandemics. It turns out that only in 
some restricted range of p and network structure parameters, global cascades 
are possible. 

40 Cliques and communities 

Cliques are fully connected subgraphs of a graph. Communities are (rather 
poorly defined) subgraphs, where vertices are 'better' connected to each 

^Note, however, a principal difference from the synchronization phenomenon: syn- 
chronization is possible even in a system of two coupled oscillators, and 'normal' phase 
transitions are realised in infinite cooperative systems. 
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other than the fuU set of the vertices of a network. For practical purposes, 
it is often important to find the full set of communities in the network. 
In principle, this problem has no unique solution. One of the numerous 
approaches for indexing the communities is based on inspecting the structure 
of the shortest paths in the network (M. Girvan and M.E.J. Newman). 

41 Betweenness 

How are shortest paths between the pairs of vertices distributed over the 
network? This distribution is characterized by betweenness. Let the to- 
tal number of the shortest paths between vertices i and j be B{i,j) and 
B{i,m,j) of them pass through vertex m. The betweenness b{m) of the 
vertex m is 



In simple terms, this is the probability that a shortest path between a pair of 
vertices of a network passes through the vertex m. In a similar way one can 
define an edge betweenness. Unlike degree, the value of the betweenness 
of a vertex reflects the topology of the entire graph. Evidently, vertices 
(edges) with high betweenness (a high fraction of passing shortest paths) 
play especially important role in a network. 

42 Extracting communities 

In the Girvan-Newman algorithm (see, e.g., Ref. [23]), edges with maximal 
betweenness in the network arc deleted one by one. This deletion changes 
the structure of the shortest paths in the network, and so a betweenness 
of each edge is recalculated after each deletion. At some step, the network 
turns out to be divided into two clusters — two largest communities, and so 
on. The result is a tree, where smaller communities arc included in larger 
ones. The distribution of the sizes of resulting communities in many real 
networks is a power law. 

43 Optimal paths 

A network shows the small-world effect if the mean shortest-path length 
£ grows slower than any power of the number of vertices, A'^. Let us re- 
move a fraction of vertices or edges from a network with the small-world 
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effect, so that we approach the percolation threshold from above. In other 
words, let us nearly eliminate the giant connected component of the net- 
work. The mean shortest-path length is determined by the shortest paths 
between vertices in the giant connected component (in total, Ng vertices). 
It turns out that near the point of the birth of the giant component, the 
liNg) dependence is a power law. 

Instead of removing edges, one may ascribe them random weights and 
consider the optimal paths between vertices. Here the optimal path is the 
shortest one taking into account the weight of edges. If the disorder (vari- 
ations of the weight) is strong, the size dependence of the optimal path 
become power-law [24]. This is a way to eliminate the small-world effect. 

44 Distributions of the shortest-path length and of the loop's 
length are narrow 

On can show that, with few exceptions, large networks with the small-world 
effect have a very narrow distribution of of the shortest-path length. As the 
size of a network grows to infinity, the ratio of the width of the distribution of 
the shortest-path length and the mean shortest-path length approaches zero. 
That is, in the thermodynamic limit, this distribution is the delta-function. 

The same is valid for the distribution of the loop's length in large net- 
works with the small- world effect [25] . 

45 Diffusion on networks 

Thus, vertices of large networks are almost surely mutually equidistant. 
This claim has a number of immediate consequences. For example, consider 
a diffusion process on a large network. A particle, which initially was at 
vertex 0, jumps from a vertex to a vertex. At what distance from vertex 
will the particle be at infinite time? The mutual equidistance of vertices 
guarantees that with the great probability, the particle will be at the distance 
J from the starting vertex. 

Note that two different diffusion problems can be considered on networks, 
but for both of them the above claim is valid. In the first problem, the 
probability that the particle leaves a vertex per time step is fixed. Then 
at infinite time, the probability that a particle will occur at a vertex, is 
proportional to the degree of this vertex. In the second diffusion problem, 
the probability that the particle moves to a given nearest neighbour per time 
step is fixed. Then finally, the particle will occur at each vertex of the net 
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with equal probability. 



46 What is moduleirity? 

The 'modularity' and the 'modular structure' of networks are frequently 
used terms, but what do they precisely mean? Assume that a network may 
be divided into modules labeled by an index i, so that the full set of modules 
is {i}. Let Cij be the fraction of edges in the network that connects modules 
i and j. Then, en is the fraction of edges in the network that are inside 
of module i, and rii = eij is the fraction of edges that are attached to 
the vertices of module j. Note that includes the term with j = i. The 
modularity [23] for this specific division of a network into modules is 



One can check that M can take values between and 1. If edges connect 
vertices irrespectively to this division into modules, then M = 0. With 
increasing M, the division into modules becomes more pronounced. 

Note that the modularity (10) is defined only for a given set of modules 
of a network. Another division of the network produces a different value of 
the modularity. In principle, one can define the modularity of a network as 
the maximum value of M for all possible sets of modules. Unfortunately, 
the computation of this maximum is a really hard problem. 

47 Hierarchical organization of networks 

This is another popular and yet poorly defined term. One of possible ways 
to characterize the hierarchical structure of a network is based on the notion 
of a hierarchical path [26] . 

A path between nodes a and b is hierarchical if (1) the degrees of verti- 
ces along this path vary monotonously from one vertex to the other or (2) 
vertex degrees first monotonously grow, reach maximum value, and then 
monotonously decrease along the path. Let the fraction H of the shortest 
paths in a network be hierarchical. Then this number H can be used as a 
metric of a hierarchical topology [27]. If i7 of a network is sufficiently close 
to 1, the network has pronounced hierarchical organization. For example, 
the Internet at the Autonomous Systems level has H ^ 0.95. 

In a number of papers, a specific degree dependence of the local clustering 
C{k) was treated as a direct indicator of the hierarchical organization of a 




(10) 
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network. We believe that in general, this association is incorrect. At least, 
the form of C{k) does not correlate with the value of H. 

48 Convincing modelling of real-world networks: Is it possi- 
ble? 

We have indicated several different ways to generate networks with complex 
architectures. But the aim is to explain how the complex architectures of 
real networks emerge. In principle, by using a sufficiently large number 
of fitting parameters of, e.g., self-organizing models, one can satisfactory 
describe a set of empirical characteristics or real networks. However, this 
'successful' description has no explanatory power. 

A really convincing model of a real network must explain a sufficiently 
large set of empirical data without fitting. The model parameters must be 
expressed in terms of only known basic numbers of a network (e.g., the total 
numbers of vertices and edges, etc.), some input rates (e.g., the rate of the 
network grows), or be expressed in terms of known 'external' factors if they 
influence the evolution of networks. So that, to be convincing one has to 
avoid fitting. The question is: is it possible to describe so complex systems 
without any fitting? 

49 The 'small' Web 

We stressed that even large real-world networks are mesoscopic objects. 
Even the extremely large Web contains only about 10^° (sufficiently 'static') 
pages. Moreover, the total volume of information on the Web is not so 
great — only about 200 Terabytes on the 'surface', 'static' Web. It is this 
Web that is explored by search engines. Due to the smallness of the surface 
Web, Google can store a large number (many dozens) of the Web copies on 
the hard drives of Google servers. 

50 The failures and perspectives of the physics approach to 
complex networks 

Networks are widespread objects with properties remarkably different from 
those of lattices. Physicists in the science of networks use traditional, ef- 
fective methods of statistical mechanics and are involved in the empirical 
research of real- world networks. This has allowed them to find new classes 
of networks, very common in the real world, and understand a number of 
their features. 
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The problem is that this new understanding of complex networks by 
physicists still did not produce significant practical results. We believe that 
a more practical, applied direction provides an encouraging perspective for 
the network research. 

51 A remark about references 

The list of references mostly contains large reviews and reference books. 
The readers can find detailed bibliography in these sources. In addition, 
several quite recent papers are included in the list. For more detailed and 
systematic introduction to the topic of complex networks, the readers may 
refer to our book [6]. 
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